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Abstract. We consider random Cayley digraphs of order n with uniformly distributed gener- 
ating set of size k. Specifically, we are interested in the asymptotics of the probability such a 
Cayley digraph has diameter two as n — > oo and k = f(n). We find a sharp phase transition 
from to 1 as the order of growth of /(n) increases past ^/n log n. In particular, if f(n) is 
asymptotically linear in n, the probability converges exponentially fast to 1. 



1. Introduction 

It is well known that almost all graphs and digraphs have diameter two [2] . This result has been 
generalized and strengthened in various directions, of which we shall be interested in restrictions 
to Cayley graphs and digraphs. 

In [7] it was proved that almost all Cayley digraphs have diameter two, and in [6] this was 
extended to Cayley graphs. The random model used in [7j [6] is the most straightforward one: in 
terms of Cayley digraphs for a given group G, one chooses a random generating set by choosing its 
elements among the non-identity elements of G independently and uniformly, each with probability 
2~™ +1 where n is the order of G. Observe that such generating sets have size at least n/2 with 
probability at least 1/2, in which case the corresponding Cayley digraphs automatically have 
diameter at most two. The less trivial part of [7] therefore concerns random Cayley digraphs in 
which the number of generators is at most half of the order of the group. 

This motivates a study of random Cayley digraphs in which the number of generators is re- 
stricted. The fundamental problem here is the following: for which functions / is it true that 
the diameter of a random Cayley digraph of an arbitrary group of order n and of degree f(n) is 
asymptotically almost surely equal to 2 as n tends to infinity? By the well known Moore bound for 
graphs or digraphs of diameter two we know that / has to increase at least as fast as ^/n. However, 
even the case when f(n) = cn for a constant c seems not to have been investigated before and, as 
we shall see, leads to interesting questions in the study of generating functions. 

In order to investigate the above problem one cannot use the model of [7]. Instead, we will 
consider the uniform distribution of subsets of size k in the set of all non-identity elements of a 
given group of order n. A detailed description of the model and the associated parameters is given 
in Section O The probability that a random Cayley digraph of (in- and out-) degree A: on a group 
of order n has diameter 2 will be estimated in Section[3]in terms of a certain combinatorial function 
p(n, k, t) where t is a parameter that depends on the group and It < n. Dependence on the group 
is then eliminated by showing that one can use t = \fyn\ for a suitable constant 7 < 1/2 in the 
estimates. Setting k = f(n) and t = [7 n Ji the probability that a random Cayley digraph has 
diameter two can be studied by means of the asymptotic behaviour of p(n, /(n), L7?ul) as n — ■* 00. 
The two cases of particular interest are f(n) = \cn\ for a fixed constant c with < c < 1/2, and 
f(n) = \n a \ for a fixed constant a such that 1/2 < a < 1. By a delicate asymptotic analysis, in 
Section |4] we prove that in both cases the diameter of a random Cayley graph is asymptotically 
almost surely equal to two. We also consider analogous questions for random Cayley graphs on 
elementary abelian 2-groups. Under this restriction we obtain tighter bounds in terms of p(n, fc, t) 
for the probabilities, which raises an interesting question on the probability evolution if f(n) ~ c^fn 
for a constant c > 1. 
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2. The model 

Throughout, let G be a finite group of order n and let k be a positive integer not exceeding 
n — 1. The set of non-trivial elements of G will be denoted by G* . For a set A and an integer r, 
the symbol (^) will stand for the set of all subsets of A of size r. 

For S £ )j the Cayley digraph on G relative to S, denoted by Cay(G, S), is the fc-valent 
digraph with vertex set G and arc set {(<?, gs) : g G G, s G S*}. The distance d(g, h) from the vertex 
g to the vertex h in Cay(G, S) is the length of the shortest directed path from g to h in Cay(G, S). 
The diameter diam(Cay(G, S)) is the smallest integer d such that for every ordered pair (g, h) the 
distance from g to h is at most d. 

We are now ready to introduce our model for random Cayley digraphs of a given valence. Let 
V(G, k) be the probability space (B, 2 B , P) where B = ( G *) , 2 B is the power set of B, and P is the 
uniformly distributed probability measure on B. Since \B\ = a simple counting argument 

shows that Pr({5}) = (V)" for 

every S G B. More generally, for every subset L C G* of size £, 
the probability that a random set S £ B contains I as a subset is given by 



a, p r(sa ^p r({ , e (-) :ies >,= (»--')(»-)- 

where (r)^ = r(r — 1) . . . (j — I + 1) denotes the £-th descending factorial of r (with the convention 
that ro = 1). We can now define a random variable Diam: S-*Ron the probability space V{G 1 k) 
by letting, for every S £ ) , 



(2) Diam(S) = diam(Cay(G, S)). 

The main goal of this article is to derive bounds on the probability of the event {S G ( . ) : 
diam(Cay(G, S)) — 2} and study the asymptotic behaviour of the bounds. 

Since Cayley digraphs are vertex-transitive, the diameter of Cay(G, S) coincides with the maxi- 
mum value of d(l, y) over all y G G*. Clearly, if 9(1, y) < 2, then y G S, or there exists x G S such 
that (l,x,y) is a directed path from 1 to y of length 2. The latter is equivalent to requiring that 
{x, x~ x y} C S*. This shows that the following events will play an important role: 

Definition 2.1. For x, y G G* , let 

T{x,y) = {S:S£( G *\{x,x- 1 y}^S} and X(y) = |J T(x,y). 

Let 5 be an arbitrary element of ( fc ) . Clearly, there is a directed path from 1 to y of length 2 in 
Cay(G, S) if and only if S G X(y). In other words, S G if and only if there is no directed path 

from 1 to y in Cay(G, S) of length exactly 2. Thus, if diam(Cay(G, S)) > 2 then S G Uj /eG »X(y). 
Therefore we have the following inequality: 



(3) Pr(Diam > 2) < ^ Pr(X(y)). 

yea- 

On the other hand, if diam(Cay(G, S)) < 2, then for every y G G* we have y £ S or S £ X(y). 
Hence Pr(Diam < 2) < Pr(X(y)) + Pr(y G S), and by it follows that Pr(Diam < 2) < 
Pr(X(y)) + -^j, which is equivalent to Pr(Diam > 2) > Pr(X(y)) - This, together with ©, 
shows that 



A: 



(4) M < Pr(Diam>2) < (n-l)M, where M — maxPr(X(y)). 

71—1 y6G* 

The inequality Q provides the basis for our investigation. In what follows we consider estimates 
for the quantity M appearing in . 
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3. The estimates 



The key to deriving bounds on M is the evaluation of the probability Pr(A(y)) = 1— Pt(U x( =g*T(x, 
As Lemma \'3 . 1 1 below shows, this probability is closely related to the values of p(n, k,t), t,k < n, 
where p(n, k, t) is defined by 

Lemma 3.1. Let y G G* and let J C G* \ {y} be a set of size t such that the sets {x, x~ 1 y}, with 
x G J, are pairwise disjoint and of size 2. Then 



Pr(X(y))<p(n,k,t). 
Proof. We start with a simple inequality 



Pr(X(y)) = 1 - Pr(X(y)) = 1 - Pr(U xeG .T(a:, y)) < 1 - Pr(U x6J T(a:, y)). 

Now observe that the set n xe iT(x,y) consists of all those S G ( , ) for which U xe j{a:, x~ 1 y} C S. 
Hence, if I C J and |/| = i, then 

Pr(n xeJ T(z,y)) 



n — 1 — 2?'\ /n — 1\ 1 (fc)2i 



k — 2i J \ k J {n— 1)21 
By the inclusion-exclusion formula, we have 

t 

Pr(U ;ce . / r(x, 2 /)) = ^(-l) 1 - 1 ^ Pr(n xe rT(z,y)) 

'A /n — 1 — 2i\ (n — l x 1 



k-2i 



EC- 1 )'" 1 1 

i=l 



w(«-l)2 

and the result follows. □ 

A straightforward consequence of the above proof is the fact that < p(n, k, t) < 1 and that 
p(n, k, t) is decreasing in t (in the range for which there is an appropriate group G for which the 
above lemma can be used). 

We continue with establishing an upper bound on the parameter t appearing in the sums above. 
For this we need an estimate of the number of 'square roots' of a non-identity element in a group. 
Therefore, for an element y € G let a(y) denote the set of x E G such that x 2 = y. 

Lemma 3.2. If y is a non-trivial element of a finite group G, then \<j(y)\ < f \G\. 

Proof. Suppose the contrary and let y S G* be such that |er(y)| > ||G|. Take an arbitrary element 
z G G, and observe that cr(y z ) = a(y) z . In particular, |cr(y z )| = \a(y)\ > j\G\, implying that 
<r(y) H o~(y z ) is non-empty. Take any x £ o~{y) n a{y z ), and note that y = x 2 = y z . Hence y is in 
the centre of G. Now consider the quotient projection tt: G — > G/(y), let s be the order of y in G, 
and let T denote the set of elements x G Gj (y) such that x 2 = 1. Clearly, w(a(y)) C T. Suppose 
that tt(xi) = tt(x2) for some pair of x\,x% G o~(y), xi,=/= x%. Then X2 = x\y T for 1 < r < s, and so 
y = x 2 = x\y 2r — y 1+2r , implying that s — 2r. This shows that the 7r-preimage of each element 
in T contains at most 2 elements from cr(y), and contains at most one element from o~(y) if s is 
odd. Therefore \T\ > > f\G/(y)\ if s is even, and |T| > \a(y)\ > if s is odd. 

On the other hand, \T\ < \G/{y)\, implying that s = 2 and \T\ > \\G/{y)\. It is known that the 
only groups for which the proportion of the involutions is more than | are the elementary abelian 
2-groups (see [H]). Hence G/(y) is an elementary abelian 2 group, say G/(y) = Zj, where p is 
a positive integer, and consequently, G = or G = Z^" 1 x Z4. However, it is clear that no 

element y in such groups satisfies |<r(y)| > j\G\. □ 
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We remark that the bound in the previous lemma is sharp. For example, if Q = {±1, ±i, ±j, ±fc} 
is the quaternion group and if G = Q x Zjj, then for the element y = (— 1,0, ...,0) we have 

<y) = l\G\. 

We are now ready to prove the main result of this section, which is an upper bound on the 
probability Pr(Diam > 2) in terms of p{n, k, t). 

Theorem 3.3. Let G be a finite group, and let k be such that 1 < k < n = \G\. Then for the 
random variable Diam on the probability space V{G, k) we have 

Pr(Diam > 2) < (n - l)p(n, k, [(n - 4) /12j ). 

Proof. Let y £ G* and let s = \cr(y)\. We first show that there exists a set J C G \ {l,y} of size 
at least t = [ "~g~ s j such that the sets {x,x~ 1 y}, where x £ J, are pairwise disjoint and of size 2. 
We shall define such a set J recursively. 

If s > n — 3, then t = 0, and J = will do the job. So we may assume that s < n — 4. Then the 
set C = G* \ (cr(y) U {y}) is non-empty, and we can choose x\ £ C and set J\ = {xi}. 

Now suppose that J\, . . . , J(_ have been already defined for some I <t, and suppose that this has 
been done in such a way that J, C C, | J,| = i and |U a:e j i {a;, x -1 y}\ = 2i for every i £ {1, . . . , £}. Let 
Ke = U xe j e {x, yx _1 , x~ 1 y}. Then \Kg\ < 3£ < 3t — 3 < n — s — 4<n — s — 2 — \C\. Hence, we can 
choose an element xg + i £ C\Kg, and define Jp + i — Jg U Clearly, \Ji+\\ = | Jt\ + 1 = 1+ 1, 

and since G C, also J^ + i C C. Now suppose that lU^gj^ {x, x~ 1 y}\ < 2£+2. Then one of the 
elements Xf+i and xj^y belongs to U xe j f {a;, However, both cases imply that x^+i G Ki, 

which contradicts our assumption. Hence this construction yields a set J = J t with the desired 
properties. 

From Lemma [3.11 it follows that Pr(X(y)) < p(n, k, [(n — 1 — s)/3j). Then by Lemma \3. 21 we 
have s < |n and therefore (n — 1 — s)/3 > (n — 4)/12. Using Q, and the fact that p(n, k,t) is 
decreasing in t we arrive at the inequality in the statement of the theorem. □ 

If the group G is an elementary abelian 2-group, then the value of M in Q can be expressed in 
terms of p(n, k, t) exactly. This leads to the following result. 

Theorem 3.4. Let G = 1% be an elementary abelian 2-group, and let 1 < k < n = 2 d . Then for 
the random variable Diam on the probability space V{G, k) we have 

k 

pin, k, (n - 2)12) < Pr(Diam > 2) < (n - l)p(n, k, (n - 2)12). 

n — 1 

Proof. We show that for any y £ G* we have Pv(X(y)) = p(n,k, 7 - ! -^ L ). Let J y be any transver- 
sal of the subgroup (y) in G, and let J = J y \ {l,yj. Then Pr(X(y)) = Pr(U xeG *T(x,y)) = 
Pt(U x£ jT(x,y)). On the other hand, the set J satisfies the conditions of Lemma l3~Tl Since 
| J | = we have 

Pr(X^)) = 1 - Pr(U xe jT(x, y)) = p(n, k, (n - 2)/2). 
The statement now follows from (fj]). □ 

It is now clear that knowledge of the asymptotic behaviour of p{n, k, t) would allow us to make 
conclusions about the asymptotic behaviour of the random variable Diam. As explained in the 
Introduction, the most interesting cases to study are f(n) = \cn\ for < c < 1/2 and f(n) — [n a \ 
for 1/2 < a < 1. For example, Theorem 13.31 shows that if lim„_ >00 (n — l)p(n, [cn\, [(n — 4)/12j) = 
for a constant c such that < c < 1/2, then the diameter of a random Cayley digraph of 
order n and degree \cn\ is asymptotically almost surely equal to two. By the same token, if 
lim n _ >00 (n — l)p(n, \ n a \ , |_(n — 4)/12j) = for 1/2 < a < 1, then the diameter of a random Cayley 
digraph of order n and degree \n a \ is also asymptotically almost surely equal to two. Similar 
statements, with n a power of two and with (n — 2)/2 in the third position, hold for random 
abelian Cayley digraphs on the basis of Theorem 13.41 In the next section we will show that the 
above limits are indeed equal to zero and therefore the corresponding random Cayley digraphs 
almost surely have diameter two. Since Theorem 13.41 gives also a lower bound, it is natural to ask 
if, for n a power of two, lim n _,c>o p(n, \ cn 1 l' 1 \, (n — 2)/2) = 1 for sufficiently large c. As we shall 
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see, the answer to this question is in the negative. In what follows we also describe more precisely 
the threshold at which Pr(Diam < 2) jumps asymptotically away from 0. 

4. Asymptotic analysis 

We use a mixture of techniques, based on generating functions, with varying levels of sophis- 
tication. Some of the questions of the previous section are quickly addressed by relatively simple 
means, while for others it is cleaner to apply asymptotic techniques for the analysis of coefficients 
of multivariate generating functions as developed in [9l [10l [HI HI IS] • See [11] for a detailed survey 
of the use of such techniques in combinatorial problems. For the hardest questions we use the 
recently developed machinery of [31 0] . 

The quantity a(n, k, t) :— (T)p{ n + 1) k, t) is simpler to analyse in this way than p(n, k, t) itself. 
It is easily seen from above that a(n,k,t) has a purely combinatorial description. Namely, given 
a set of size n, we choose t disjoint pairs from this set. Then a(n,k,t) is the number of subsets 
of size k that contain none of the pairs. Note that a(n, k, t) = if k + t > n, by the pigeonhole 
principle (since the complement of S has size less than t, S must contain at least t + 1 of the 2t 
paired elements). 

From the statement of the problem, a(n, k,t) is not defined if 2t > n; however, formula ([5]) still 
makes sense in that case, even though it does not define the probability of any event. In fact, 
a(n, k,t) can be negative for large t. Asymptotic analysis in this case is considerably more difficult 
than what is presented below, and we will avoid this case in the present paper, since it is not 
relevant to the original combinatorial question. 

4.1. Generating functions. We first compute the trivariate generating function of a(n,k,t). 
The most direct approach is to use some well-known bivariate generating functions . aijX l y J . 

Throughout, we use the convention that the binomial coefficient (,), with k,l € Z, is zero unless 
< I < k. If ay = then the generating function is (1 — x — y)^ 1 , while that for = Q) is 

(1 — x(l + y)) ■ We now compute 

n,k,t,i \ / \ / N,K,i,j 




1 



x N y K 



1 — z(l + wx 2 y 2 ) 1 — x(l + y) ' 
which yields the trivariate generating function 

(6) G(x,y,z)=Y,a(n,k,t)x n v h z t = - 1 2 ^— - . 

^ t I - z{\ - x 2 y 2 ) 1 - x{\ + y) 

Note that if we impose the restriction 2t < n, then the sum over N is restricted to N > 2j. 
Now summing over N, K,i,j as above we obtain the more relevant restricted trivariate generating 
function 

(7) Gi(x,v,z)= a(n, k, t)x n y k z t = \ J- =: — - — . 

K> U y ' <^<x l-x{l+y)l-zx*{l + 2y) H X H 2 

The series G± is more useful for our purposes, since all coefficients are nonnegative. 

4.2. Basic asymptotic approximations. We list here some standard asymptotic approxima- 
tions that will be used later. 

Lemma 4.1. Write n — Xk with < A < 1. Then 

(8) Q = exp(ni?(A)) P(A) ttT 1 ' 2 C(n, A) 
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where 

R(X) = —A log A — (1 — A) log(l - A) 
P(A) = (2ttA(1- x)y 1/2 
C(n, A) = (1 + Oin- 1 ) + O^nA)" 1 ) + 0((n(l - A))" 1 ) as n -> oo. 
Proof. A direct application of Stirling's approximation. □ 

We call R the exponential rate and P the leading coefficient, while C is the correction term. Of 
course R depends on A and so may vary with n as n — ► oo. 

Lemma 4.2. For i > k > define 
Then with t = Xk we have 

(9) b(t, k) = exp(tR(A)) P(X) C{t, k) 

where 

R(X) := (2 - A) log(l - A/2) - (1 - A) log(l - A)) 
/ o x \ 1/2 

C(A) := 1 + Oit- 1 ) + OiitX)- 1 ) + 0((t(l - A))- 1 ) as i -» oo. 

Proof. An immediate application of the previous lemma (replace n by 2t and A by A/2 in the 
denominator and replace n by t in the numerator). □ 

We also need some standard facts about the stationary phase approximation of an oscillatory 
integral. We recall them below and refer to a standard text such as 13 for details. Define 

/(/;«)= [ b e n ^g(9)de 

J a 

where / and g are smooth functions and 5ft/ > on [a, b]. 

Lemma 4.3 (Laplace approximation). Let I(f;n) be as defined above. Suppose that 5ft/ > on 
[a, 6] except at a single point x £ (a, 6). Furthermore suppose that f'(x) = and f"{x) ^ 0, while 
g{x) 7^ 0. Then 

/(/; n) = exp(n/(x)) -^==== (l + 0(n -1 )) as n -> oo. 

T/ie implied constant in the O-term remains bounded as we vary f and g provided that no hypotheses 
change, x remains in a compact subset of (a, b), f"(x) remains bounded away from zero, and the 
maximum of \g\ remains bounded. 

□ 

4.3. Abelian groups. 

4.3.1. The linear case. The simpler form of the bounds involving p(n, k, t) in the abelian case allows 
an easy elementary approach, which we now present. 

Suppose that t = (n — 2)/2, so that n — 1 = 2b + 1. A direct evaluation as in Section |4~T1 above 
shows that 



and hence we may extract coefficients to obtain 



tk I M i ofe-i ( t 



a(2t+l,k,t) = 2«[ 1+2 _ ;| 
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Now 



2t+l^ 



p(2t + 2,k,t) = ( ) a(2t + l,k,t) 



kj \k 

: [6(t,A)][c(t,fc)] 



(2i-fc + 2)(2f + l-fc) 
(2t-2fc + 2)(2t+l) 



When k = t + 1, the right side above should be replaced by 2', but we do not deal with this case 
below anyway (since k < n/2, n is even and k is an integer we must have k < (n — 2)/2 = £). 
From Q, we obtain the exponential rate of bit, k) with respect to t as 

R(X) = (2 — A) log(l - A/2) - (1 - A) log(l - A) 

with A = k/t. This is easily seen by elementary calculus to be negative for < A < 1. Furthermore 
c(t, k) has exponential rate zero. Thus in combination with Theorem 13.41 we have: 

Theorem 4.4. For any constant c such that < c < 1/2, the diameter of a random Cayley digraph 
on an elementary abelian 2- group of order n and degree \_cn\ is asymptotically almost surely equal 
to two. Furthermore the convergence is exponentially fast. □ 

4.3.2. The sublinear case. We now consider the case where k is of order n a with 1/2 < a < 1. For 
k = Xt with A = o(l) as t — > oo, we have c(t, k) = 1 + 0(A). By ((9]) we again have 

R(X) = (2 — A) log(l - A/2) - (1 - A) log(l - A) = -A 2 /4 + 0(A 3 ). 

Thus if k grows at least as fast as n a with a > 1/2, it is definitely the case that the upper 
bound (2t + l)b(t, k)c(t, k) decays faster than polynomially. Using Theorem 13.41 again, we have the 
following conclusion. 

Theorem 4.5. For any constant a such that 1/2 < a < 1, the diameter of a random Cayley 
digraph on an elementary abelian group of order n and degree \_n a \ is asymptotically almost surely 
equal to two. □ 

Also, the approximation above shows that if k = [cy/n\ then the lower bound b(t, k)c(t, k) 
converges to exp(— c 2 /2), and not 1. This shows that for n a power of two, p(n, [en 1 / 2 ] , (n — 2)/2) 
does not tend to 1 as n — > oo. 

The case of general groups could be attacked in a similar way to the elementary approach 
above, but with more effort. For example, p(n, k, t) is decreasing in t, so that p(n, k, (n — 2)/12) > 
p(n, k, \_(n — 4)/12j ) > p(n, k, (n — 4)/12) for sufficiently large n. As above we can compute the 
bivariate generating function for F(12t + 3, k, t) and F(12t + 1, k, t), and estimate each of these as 
above when k is of order n a . However, this approach depends heavily on the relatively nice formula 
for p(n, k, t) involving well-studied binomial coefficients. For variety, and to illustrate that more 
detailed expansions can be obtained in more generality, we use a different approach in Section [4.41 



4.4. General groups. In this section we use Theorem 13.31 to study the asymptotic behavior of 
the diameter of a general random Cayley digraph of order n and degree k. We again consider two 
different regimes, namely k = \cn\ and k — \n a \ where 0<c< l/2<a< 1. 

Our analysis is in terms of parameter-varying integrals that lead to uniform asymptotic expan- 
sions for the coefficients a(n,k,t), with t = [(n — 4)/12j. We make heavy use of the fact that 
n > 2k and n > 2t for both regimes to reduce the problem to the analysis of a one-dimensional 
parameter-varying integral. For the first regime, the asymptotic behavior of the resulting integral 
relies on the stationary phase method as found for example in [15] . Instead, for the second regime, 
the analysis follows the lines of [3J and [J] to properly use the Laplace approximation of Lemma |4~31 

In what follows it is always assumed that n > 2k > and n > 2t > 0. We first reduce the 
problem to computing the asymptotics of a certain one-dimensional integral. Since a(n, k, t) = 
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x n y z t ]Gi(x,y, z), we obtain 



i- x {i + y y 

oo 

= [x n - 2t y k \Y J x l {l + y)\l + 2yf, 
1=0 

= [/](l + y)»- 2t (l + 2y)*. 
Using Cauchy's formula, the above implies for all r > that 

(10) a(n, k,t) = 1 — [ (1 + re ie ) n ' 2t {l + 2re ie ) t e~ lke d9. 
In particular, we can rewrite 

(11) a(n, fc, <) = (27r) _1 • #(r; ra, k, t) ■ I(r; n, k, t), 
where 

E(r; n, k, t) := r- fc (l + r) n ~ 2t (l + 2r)*, 

l + re ie \"" 2t /l + 2re^ 4 



The integral in (fT0| has been normalized by the factor (1 + r)"~ 2 *(l + 2r)* to emphasize that 
the modulus of each of the two factors in the integrand is maximized at = 0. 

To determine the asymptotic behavior of a(n,k,t) the goal is to tune r with (n,k,t) so that 
I(r;n, k,t) decays polynomially with n, in other words so that E(r;n,k,t) captures the precise 
exponential growth rate of the coefficients a(n, k, t). 

To accomplish our goal, motivated by the stationary phase method, we rewrite the integrand of 
J(r; n, k, t) in an exponential-logarithmic form to obtain 

(12) I(r;n,k,t)= j exp { - n ■ F(6;r, di, d 2 ,d 3 )}d9, 



where 



1 + re ) f 1 + 2re* 

F(9;r,di,d 2 ,d 3 ) :— d 3 ■ i6 — d\ ■ In I — > — e£ 2 • In • 



1 + r i " 1 + 2r 



di 



n-2t 

11 

t 

n 
k 



n 

In what follows all logarithms are to be interpreted in the principal sense. In addition, unless 
otherwise stated, d\ : d 2 and ^3 always stand as short forms of the functions defined above. As a note 
on our terminology, we refer to 7(r; n, k, t) as a parameter- varying integral because F(0; r, d\,d 2 , d 3 ), 
the so called phase term, depends upon the parameter n itself. 

We note for later that the exponential rate of E(r; n, k, t) in these variables is given by 

(13) limsup logE ( r > n ' fc ' *) = -tfelogr + d x log(l + r) + d 2 logfl + 2r). 

n n 

Before analyzing the asymptotic behavior of I(r;n, k,t), we discuss the properties satisfied by 
the phase term that are essential for the application of the Laplace approximation. For this and 
based upon analytic properties to be clarified shortly, observe first that 
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(14) ^(0;r, = iU- ^ ^ 



89 y > ' " " °> [° l + r l + 2r 

d 2 F dir d 2 r 

(15) —- T {0;r,d 1 ,d 2 ,d 3 ) 



Q02 v „, 2(1 + r) 2 (l + 2r) 2 ' 

Thus, in order for = to be a stationary point of F(9; r, d%, d 2 , d 3 ), r and (n, k, t) must satisfy 
the relation d 3 = d\r/(l + r) + 2d 2 r/(\ + 2r). A solution r > to this equation is given by the 
formula 

(16) M3 



(1 - 3d 3 ) + V(l-3d 3 ) 2 + 8d 3 {d 1 + d 2 - d 3 ) 

Note that there is a unique positive solution for r. On the other hand, using that d% + 2d 2 — 1, it 
follows almost immediately that 

(17) - 7 ^r(0;r,d u d 2 ,d 3 )> 



dff 2 v ' ' L ' z ' " ~ 2(1 + 2r) 2 ' 
Similarly, but after using that ln(l — w) < —w/2, for all < w < 1, it follows that 

(18) M{F(9; r, d u d 2 , d 3 )} > 

Thus for given d\,d 2 , d 3 in the right range, F has a single stationary point at 9 = 0, satisfying 
the hypotheses of Lemma l4.3l Furthermore since g — 1 there the Laplace approximation is uniform 
as long as no hypotheses change and r remains bounded away from zero. 

In what follows, unless otherwise stated, r always stands for the short form of the term defined 
in (|16j) . In addition, we write E(n,k,t), I(n,k,t) and F(9]di 1 d 2 ,d 3 ) respectively as a short form 
for E(r; n, k, t), J(r; n, k, t) and F{6; r, di, d 2l d 3 ). 

4.4.1. The linear case. We first study the asymptotic behavior of the coefficient a(n,k,t) for the 
regime where k = [cn\ and t = [(n — 4)/12j , with < c < 1/2. In this case, asn^ 00, d\ — » 5/6, 
d 2 — > 1/12, d 3 — > c and r — > r c , where r c > is the quantity defined as 

(19) r ~ 2c 

(1 - 3c) + ^/(l-3c) 2 + 8c(ll/12-c) 

In particular, if c is bounded away from zero then for sufficiently large n independent of c, r is 
also bounded away from zero. Thinking momentarily of (9;r,di,d 2 ,d 3 ) as a vector of unrelated 
variables, observe that there exists a sufficiently small < 6 < tt such that F(6; r, d\, d 2 , d 3 ) is an 
analytic function of 9 for \9\ < 25, for all (r, d\, d 2 , d 3 ) such that |r — r c \ < 26. Thus by Laplace's 
approximation I(n, k, t) is asymptotically of order n" 1 / 2 asm 00. Hence the exponential rate of 
p(n, k, t) is indeed given by that of (?) 1 E(n, k, t). Using ([8]) and (| 13|) we see that this rate is 

5 1 

- ln(l + r c ) + — ln(l + 2r c ) - cln(r c ) + (1 - c) ln(c) + cln(c). 

6 12 

It is readily computed that the supremum of the exponential rate for < c < 11/12 occurs only 
when c — > + and has value 0. Thus certainly for < c < 1/2, the exponential rate is negative. 
This together with Theorem 13.31 yields the following result. 

Theorem 4.6. The diameter of a random Cayley digraph of order n and degree k is asymptotically 
almost surely equal to two provided that k/n remains in a compact subset of the interval (0, 1/2) 
as n — > 00 . Furthermore, the convergence is exponentially fast. □ 

Of course the same result will hold for larger values of c. Note that when c > 11/12 then for 
large enough n, k + t > n for the value of t considered here and so a(n, k, t) = 0. 

We note in passing that in this case where k is asymptotically linear in n, the analysis of 
the asymptotics of a{n,k,t) is easily accomplished by the recently developed methods of Robin 
Pemantle and Mark Wilson [HI [TO] • The resulting asymptotic expansion can be read off almost 
directly from the explicit expression for G\. We refer to [TTJ Section 4.9] for more details. However, 
the methods of [SI [TO] do not work directly in the sublinear case, unlike the methods of the present 
paper. 
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4.4.2. The sublinear case. Next we study the asymptotic behavior of a(n, k, t) for the regime where 
k = [n a \ and t = [(n - 4)/12j , with 1/2 < a < 1. As before, di -> 5/6 and d 2 1/12. However, 
da — * and therefore r — > as n — ► oo. The new difficulty here is that the phase term of I(r; n, k, t) 
converges uniformly to for all — 7r < 9 < tt as n — > oo. 

To resolve this issue we factor out r, exploiting the fact that F(9;r,di,d2,d 3 ) is also analytic 
with respect to r. Indeed, thinking again of (0; r, di, d 2 , d 3 ) as a vector of unrelated variables, 
observe that there exists 5 > such that F(0; r, di, d 2 , d 3 ) is an analytic function of (0; r) for 
|0| < 2tt and \r\ < 2d, for all (d u d 2 ,d 3 ). Thus since F(0; 0, d x , d 2 , d 3 ) - (0; 0, d x , d 2 , d 3 )0 = 0, 
there exists a function Fi(9; r, d\, d 2 , d 3 ), analytic in (9; r) for \9\ < 2ir and |r| < 25, such that 

dF 

F(8;r,d 1 ,d 2 ,d 3 ) - -^{0;r,d 1 ,d 2 ,d 3 )9 = r ■ F 1 (8;r,d 1 ,d 2 ,d 3 ). 

In what follows we write Fi(9; d\, d2, d 3 ) as a short form for F\(9; r, d\, d2, d 3 ). Reverting to our 
value of r given by ([TB)1 . we see that 

(20) F(9;d 1 ,d 2 ,d 3 ) = r ■ F 1 {8;d 1 ,d 2 ,d 3 ). 
Using (O and (gHl), we obtain 

(21) I(n,k,t)= f e - nr -MS-dud 2 ,d 3 ) d0j 



for all (n,k,t) such that < r < 6. Furthermore, given the factorization in (|2H)l . it follows from 
(fT4|) . p6]) . pT)) and (HHJ) that ^(0; d x , d 2 , d 3 ) = and that 

d 2 Fi 1 
-(0; di,d 2 , d 3 ) > 



2(1 + 2r) 2 ' 

Now it is readily computed from the definition that 
(22) r ~ d a - d 3 {dx - 5/6) - 2d 3 {d 2 - 1/12) + 7d 2 3 /6. 

In particular, given that d\ — > 5/6 and c?2 — > 1/12, we have n-r^n-d 3 ~k^ oo, as rt — > oo. 

Since ^(0; di, d2, is analytic in the disk |0| < 27r, the Laplace approximation can be reapplied 
but this time to determine the asymptotic behavior of the integral on the right-hand side of (I21|) . 
It follows that I(n, k,t) is asymptotically of order (nr) -1 / 2 ~ fc -1 / 2 = 0(n -1 / 2 ). As a result, the 
exponential growth rate of p(n, k, i) is again given by that of (?) E(n, k, t). 

The exponential rate in question is 

d 3 log d 3 + (1 - d 3 ) log(l - d 3 ) - d 3 logr + (1 - 2d 2 ) log(l + r) + d 2 log(l + 2r). 

Using (|2"2"|) we see that as n — > oo this rate is asymptotic to — d 2 /12. When k = [n a \ with a > 1/2 
the exponential part of p(n, k,t) is therefore exp(n( 1 ~ 2a ^ 12 (l + o(l)). With the help of Theorem 
we finally obtain: 



Theorem 4.7. For any constant a such that 1/2 < a < 1, £/ie diameter of a random Cayley 
digraph of order n and degree \ n a J is asymptotically almost surely equal to two. □ 

Note that convergence of the upper bound to zero is faster than polynomial, but subcxponcntial. 

4.5. The threshold. We have not yet answered the original question in the introduction, con- 
cerning the threshold for k = f(n) at which the asymptotic value of Pr(Diam > 2) undergoes a 
phase transition, switching abruptly from 1 to as /c increases. 

The methods above give some useful information on this point. Consider the simpler analysis of 
Section 14. 3[ concerning abelian 2-groups (similar calculations occur when considering the bounds 
for general groups). Assuming that k = o(t), we see that the lower bound has order of growth 
equal to that of b(t,k) as t — > oo. As we have seen, if k = Q(t a ) with a > 1/2, then b(t,k) 
converges to zero faster than any polynomial, so the upper bound converges to zero. To see where 
the upper bound is asymptotically constant, we observe from the approximation that exp(— A 2 i/4) 
must be of order t . Thus we require k w 2^/t logi for this to occur. At this stage the upper 
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bound converges to 2; the more precise k = log t + log 2 yields a limiting upper bound of 1. At 
this stage the lower bound looks like 1/t and converges to 0. The lower bound converges to 1 only 
when k = o(y/i) which is not useful. If k grows faster than y/ 1 log i, the upper bound converges to 
zero. If k grows slower than y/i logt, then the upper bound goes to infinity with t. This gives a 
threshold (rather weaker than we might hope for) in the abelian case. 

In the nonabelian case we can make a similar argument with the upper bound. However we do 
not have a good lower bound on the probability. It may be possible to extract one by refining our 
arguments of Sections [2] and [3] However Robin Pemantle (personal communication) has discovered 
an approach using probabilistic techniques and along these lines that gives sharper results on the 
threshold. Thus we do not proceed further here, preferring to await the appearance of Pemantle's 
work. 

5. Conclusions 

We have derived precise information on the event that a random Cayley digraph has diameter 
2, in the abelian group case, and slightly less precise information in the general case. Many natural 
questions have been answered by our asymptotic analysis of upper and lower bounds on probability. 
An open question concerns the behaviour in the abelian case when k = Cy/n. Our upper bound on 
probability converges to oo and the lower bound to exp(— c 2 /2). Perhaps better bounds will allow 
us to determine the exact limiting probability using methods similar to those in this paper. 

The genesis of this paper may be of interest. PP and the first JS were visiting the second JS in 
Auckland, where they derived the bounds of Section 3 and posed several questions regarding the 
asymptotic behaviour. Their enquiries about asymptotic analysis led from Auckland to Slovenia 
(M. Petkovsek) to Pennsylvania (H. Wilf) and then via Robin Pemantle to ML and MW, the latter 
being blissfully unaware in Auckland of the existence of the work going on in the same building! 
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